Cross-validated Bandwidths and Significance Testing
نویسندگان
چکیده
The development of an asymptotic theory relating to the size of a bandwidth of a variable that is entered into a nonparametric regression is becoming well understood. However, the connections between variables that are smoothed away and those that are tested for significance has not been previously studied. This paper proposes a variety of simulation exercises to examine the performance of both cross-validated bandwidths and individual and joint tests of significance. We focus on settings where the hypothesis of interest focuses only on continuous variables, only on discrete variables, and finally a mix of discrete and continuous variables and uses tests that can handle either data type individually as well as jointly. Our results suggest that individual tests of significance and variable specific bandwidths are very close in performance, but joint tests and joint bandwidth recognition produce substantially different results. This underscores the importance of testing for joint significance when one is trying to arrive at the final nonparametric model of interest.
منابع مشابه
Plug-in Bandwidth Selection for Kernel Density Estimation with Discrete Data
This paper proposes plug-in bandwidth selection for kernel density estimation with discrete data via minimization of mean summed square error. Simulation results show that the plug-in bandwidths perform well, relative to cross-validated bandwidths, in non-uniform designs. We further find that plug-in bandwidths are relatively small. Several empirical examples show that the plug-in bandwidths ar...
متن کاملFriedman’s Super Smoother
Friedman’s super smoother is a nonparametric regression estimator based on local linear regression with adaptive bandwidths (Friedman [1984]). The basic idea is to first estimate a number of fixed bandwidth smooths by local linear regression. The leave-one-out cross-validated residuals from each of those initial estimates are then smoothed using a constant bandwidth. Based on the smoothed resid...
متن کاملBiased binomial assessment of cross-validated estimation of classification accuracies illustrated in diagnosis predictions
Multivariate classification is used in neuroimaging studies to infer brain activation or in medical applications to infer diagnosis. Their results are often assessed through either a binomial or a permutation test. Here, we simulated classification results of generated random data to assess the influence of the cross-validation scheme on the significance of results. Distributions built from cla...
متن کاملPerformance Evaluation and Experiment of a Configuration Algorithm for Three-Stage Multi-Granularity Optical Cross-Connects
We study the configuration issue of three-stage multigranularity optical cross-connects (MG-OXC) for the dynamic traffic model in all-optical networks. From the single node point of view, we propose a configuration algorithm to configure different granularity crossconnects for arrival sub-requests with different traffic types and bandwidths. The performance of the configuration algorithm is eva...
متن کاملA Model-Based Approach for Variable Bandwidth Selection in Kernel Density Estimation
A new procedure is proposed for deriving variable bandwidths in univariate kernel density estimation. Rather than concentrate on minimising some criterion based upon the mean integrated square error (MISE) or related quantity, we build models for the data and use exact calculation or sampling methods to make inferences about the bandwidths as appropriate. These Bayesian models are based on a cr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008